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PYRO SEQUENCING METHODS AND RELATED COMPOSITIONS 

5 This application claims benefit of U.S. Provisional 
Application No. 60/692, 816, filed June 21, 2005, the 
contents of which are hereby incorporated by reference. The 
invention disclosed herein was made with government support 
under a grant from the Center for Excellence in Genomic 
10 Science Grant No. P50 HG002806. Accordingly, the U.S. 
Government has certain rights in this invention. 

Throughout this application, various publications are 

referenced in parentheses by number. Full citations for 

15 these references may be found at the end of the 

specification immediately preceding the claims. The 

disclosures of these publications in their entireties are 

hereby incorporated by reference into this application to 

more fully describe the state of the art to which this 

20 invention pertains. 

Background of the Invention 

Pyrosequencing is a method based on the detection of the 
25 pyrophosphate group that is generated when a nucleotide is 
incorporated in a DNA polymerase reaction [1]. Each of the 
four deoxynucleotides (dNTPs) is added sequentially to the 
DNA template to be sequenced with a cocktail of enzymes and 
substrates in addition to the usual polymerase reaction 
30 components. If the added nucleotide is complementary with 
the first available base on the template, the nucleotide 
will be incorporated and a pyrophosphate will be released. 
The released pyrophosphate is converted to ATP by 
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sulfurylase, and this ATP is the substrate for a 
lucif erase, e.g. firefly lucif erase, which reaction 
produces visible light. If the added nucleotide is not 
incorporated, no light will be produced and the nucleotide 
5 will simply be degraded by the enzyme apyrase. This 
pyrosequencing technique, schematized in Fig. 1, has been 
applied to single nucleotide polymorphism (SNP) detection 
and other applications [2] . 



10 There are, however, inherent difficulties in the 
traditional pyrosequencing method for determining the 
number of incorporated nucleotides in homopolymeric regions 
(e.g. a string of several T's in a row) of the template. 
Moreover, dATP greatly interferes with the luciferase 

15 detection system, which is deficient in the detection of 
dATP. 



20 
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Summary of the Invention 

This invention provides a method for determining the 
5 nucleotide sequence of a single-stranded DNA comprising 
performing the following steps for each nucleic acid 
residue of the DNA whose identity is to be determined: 
(a) contacting the DNA under DNA polymerization-permitting 
conditions with (i) a 3 f -0-blocked dNTP selected from 
10 the group consisting of 3' -O-blocked dATP, 3'-0- 

blocked dCTP, 3' -O-blocked dGTP, and 3'-0-blocked 
dTTP, and (ii) 9°N DNA polymerase (exo-) A4851/Y409V 
or another DNA polymerase able to incorporate 3'-0- 
blocked dNTPs; 

15 (b) (i) determining whether pyrophosphate is generated as 
a result of step (a), whereby (1) pyrophosphate 
generation indicates that polymerization has occurred 
and the identity of the nucleic acid residue in the 
DNA is that which is complementary to the 3 ' -O-blocked 

20 dNTP used in part (i) of step (a) f and (2) the absence 

of pyrophosphate generation indicates that the 
identity of such nucleic acid residue is not that 
which is complementary to such 3' -O-blocked dNTP, and 
(ii) if pyrophosphate is not generated, repeating step 

25 (a) once, twice or three times as necessary, wherein 

in each repetition a 3 f -O-blocked dNTP is used which 
is different from any 3' -O-blocked dNTP already used, 
and determining, after each repetition of step (a) , 
whether pyrophosphate is generated, such generation 

3 0 indicating that polymerization has occurred and the 

identity of the nucleic acid residue in the DNA is 
that which is complementary to the 3' -O-blocked dNTP 
used in part (i) of the repeated step (a); and 
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(c) removing from the 3'-0-blocked dNTP polymerized in 
step (a) or (b) , whichever is applicable, the moiety 
blocking the 3'-0 atom of the dNTP, with the proviso 
that such removing step is optional in the event that 
5 there remains no further nucleic acid residue of the 

DNA whose identity is to be determined. 
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Brief Description of the Figures 

Figure 1 : Schematic of pyrosequencing in solution. 

Reactants not shown are APS, which with PPi is 
5 converted to ATP and S0 4 2 ~ by ATP-sulfurylase . In 

addition luciferase acts on ATP, luciferin and O2 
to give AMP, PPi, oxylucif erin, CO2 and light, 
and apyrase converts ATP and dNTP to AMP, dNMP 
and 2 Pi. 

10 

Figure 2 : 3' -O-Allyl-dNTP (A, C, G, T) , instead of dNTP, is 
used in the single base extension on a solid 
surface. Four Allyl-dNTPs are added iteratively. 
Once the complementary base is incorporated, the 

15 pyrophosphate that is produced from the reaction 

can be detected by its reaction with the light- 
generating luciferase system or a receptor- 
indicator (R-I) coordination compound via the 
release of the fluorescent indicator molecule. 

20 Then the extended- primers can be deallylated, 

washed and reused in the next round. The use of 
an allyl-group solves inherent problems of 
traditional pyrosequencing . 

25 Figures 3A-3C : Schematic representation and step-by-step 
MALDI-TOF MS results for the deallylation of an 
allyl-modif ied oligonucleotide (SEQ ID NO:l) and 
the use of the deallylated DNA product as a 
primer in a polymerase extension reaction. (A) 

30 Peak at m/z 5871 corresponding to the HPLC- 

purified 3'-allyloxy 19-mer oligonucleotide. (3) 
Peak at m/z 5831 corresponding to the above DNA 
product without the allyl group, obtained after 
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30 sees of incubation with the Na 2 PdCl 4 catalyst 
and the TPPTS (P (PhS0 3 Na) 3 ) ligand at 70 °C. (C) 
Peak at m/z 6535 corresponding to the extension 
of the deallylated DNA product by Biotin-ddGTP 
5 using Thermo Sequenase DNA Polymerase. 

Figure 4 : Synthesis of a 3' -O-allyl-modif ied 

oligonucleotide . 



10 Figures 5A & 5B : MALDI-TOF MS spectra showing the 
incorporation of 3 ' -O-allyl-modif ied dTTP into a 
growing DNA strand by 9°N Polymerase (exo-) 
A485L/Y409V; (A) unextended primer at m/z 5526; 
(3) primer extended with 3 9 -O-allyl-dTTP at m/z 

15 5869. 

Figures 6A & 6B : (A) Receptor (R) : Zn 2+ -dipicolylamine 

(Zn2+DPA) ; (B) Indicator (I): fluorescent 
molecule (coumarin-derived indicator) . See [5] . 

20 

Figures 1A & IB : When R is titrated into I, the 
fluorescence intensity of I will decrease: (A) 
Zn 2+ DPA is titrated into (lOuM) I; (B) PPi is 
added to R-I coordination compound. 

25 

Figure 8 : Ronaghi's real-time pyrosequencing . 
Figure 9 : Improved real-time pyrosequencing method. 



30 Figure 10 : Structures 
nucleotides . 



of four reversibly-blocked 
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Figure 11 : Mass spectrometry traces showing 

incorporation of four different reversibly- 
blocked allyl-dNTPs into a growing DNA strand in 
the solution phase. 

5 

Figure 12 : Polymerase extension reaction with 3'-0- 

allyl-dGTP-allyl-biodipy-FL-510 as a reversible 
terminator of SEQ ID NO: 2. 



10 Figure 13 : Experimental results of pyrosequencing a DNA 

template (SEQ ID NO: 3) in solution with allyl- 
dGTP and comparison with 'regular' unblocked 
nucleotides. The results indicate that allyl-dGTP 
is a good terminator in solution phase, and the 

15 incorporation Signal can be easily detected. 



20 



30 



Figure 14 : An experimental scheme of a method employing 

allyl-dGT? for pyrosequencing with attachment of 
the primer (SEQ ID NO: 4 and SEQ ID NO: 5) to a 
solid surface/bead using an NKS ester. 



Figure 15 : Comparison of pyrosequencing using 'regular' 

dNTPs and pyrosequencing using reversibly-blocked 
25 dNTPS (SEQ ID NO:3). 

Figure 16 : Pyrosequencing data using reversible 

terminators on sepharose bead immobilized looped 
primer-DMA (SEQ ID NO:3). 



Figure 17 : Light production by lucif erase in the 

presence of dATP and in the presence of allyl- 
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dATP, demonstrating that allyl-dATP is not a 
lucif erase substrate . 



Figure 18 : Technique of immobilizing double- stranded 

5 DNA (SEQ ID NO : 6 (top strand) and SSQ ID NO: 7 

(lower strand) ) to a derivatized bead and 
pyrosequencing using "normal" nucleotides. 



Figure 19 : Pyrosequencing on sepharose bead- immobilized 

10 DNA (SEQ ID NO : 6 (top strand) and SEQ ID NO : 7 

(lower strand)) using Allyl-dNTPs . 
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Detailed Description of the Invention 



Definitions 



5 As used herein, and unless stated otherwise, each of the 
following terms shall have the definition set forth below . 



PPi - pyrophosphate 

dNTP - deoxynucleoside 5' -triphosphate - also known as 

10 a deoxynucleotide 

APS - adenosine 5 ' -phosphosulf ate 

ATP - adenosine 5 9 -triphosphate 

dAT? - deoxyadenosine 5 ' -triphosphate 

THF - tetrahydrof uran 

15 TEA3 - tetraethylammonium bromide 

TPPTS - tri sodium salt of tri (m-sulf ophenyl ) -phosphine 



"Nucleic acid" shall mean any nucleic acid molecule, 
including, without limitation, DNA, RNA and hybrids 

20 thereof. The nucleic acid bases that form nucleic acid 
molecules can be the bases A, C, G, T and U, as well as 
derivatives thereof. Derivatives of these bases are well 
known in the art, and are exemplified in PGR Systems, 
Reagents and Consumables (Perkin Elmer Catalogue 1996-1997, 

25 Roche Molecular Systems, Inc., Branchburg, New Jersey, 
USA) . 



Embodiments of the Invention 

30 Disclosed here is a method that solves the problems of 
homopolymeric regions and dAT? interference by using 3'-0- 
allyl-nucleotides as reversible terminators in 

pyrosequencing using either a new PPi detection system 
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(Chemosensing Ensemble) , or the traditional luciferase 
detection technique. 

Specifically, this invention provides a method for 
5 determining the nucleotide sequence of a single-stranded 
DNA comprising performing the following steps for each 
nucleic acid residue of the DNA whose identity is to be 
determined: 

(a) contacting the DNA under DNA polymerization-permitting 

10 conditions with (i) a 3'-0-blocked dNTP selected from 

the group consisting of 3' -O-blocked dATP, 3'-0- 
blocked 'dCTP, 3' -O-blocked dGTP, and 3' -O-blocked 
dTTP, and (ii) 9°N DNA polymerase (exo-) A4851/Y409V 
or other DNA polymerase; 

15 (b) (i) determining whether pyrophosphate is generated as 
a result of step (a), whereby (1) pyrophosphate 
generation indicates that polymerization has occurred 
and the identity of the nucleic acid residue in the 
DNA is that which is complementary to the 3' -O-blocked 

20 dNTP used in part (i) of step (a), and (2) the absence 

of pyrophosphate generation indicates that the 
identity of such nucleic acid residue is not that 
which is complementary to such 3' -O-blocked dNTP, and 
(ii) if pyrophosphate is not generated, repeating step 

25 (a) once, twice or three times as necessary (i.e. 

until pyrophosphate is generated) , wherein in each 
repetition a 3' -O-blocked dNTP is used which is 
different from any 3' -O-blocked dNTP already used, and 
determining, after each repetition of step (a) , 

30 whether pyrophosphate is generated, such generation 

indicating that polymerization has occurred and the 
identity of the nucleic acid residue in the DNA is 
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that which is complementary to the 3' -O-blocked dNTP 
used in part (i) of the repeated step (a); and 
(c) removing from the 3'-0-blocked dNTP polymerized in 
step (a) or (b) , whichever is applicable, the moiety 
5 blocking the 3'-0 atom of the dNTP, with the proviso 

that such removing step is optional in the event that 
there remains no further nucleic acid residue of the 
DNA whose identity is to be determined. 



10 The identity of a nucleic acid residue in the DNA being 
sequenced is that which is complementary to the 3'-0- 
blocked dNTP incorporated, i.e. such identity is determined 
by the well-established complementary base-pairing rules. 
For example, if a 3'-0-blocked dATP is incorporated, then 

15 the corresponding nucleic acid residue in the DNA being 
sequenced is a thymine. If a 3'-0-blocked dGTP is 
incorporated, then the corresponding nucleic acid residue 
in the DNA being sequenced is a cytosine, and so forth with 
the understanding that adenine and thymine are complements 

20 of each other, and guanine and cytosine are complements of 
each other. In addition, uridine is a complement of 
adenine . 

A 3'-0-blocked deoxynucleotide is a deoxynucleotide having 
25 attached to the 3' oxygen of its sugar component a chemical 
group, for example an allyl group, that precludes further 
polymerization from the 3' oxygen until that blocking group 
is removed. 

30 This invention further provides the instant method, wherein 
determining whether pyrophosphate generated in step (b) (i) 
is performed by detecting light generated by a luciferase- 
based reaction. In one embodiment, the lucif erase is 
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firefly lucif erase. In another embodiment, the lucif erase- 
based reaction comprises contacting the pyrophosphate with 
a sulfurylase under conditions permitting the generation of 
ATP from the pyrophosphate, and contacting the AT? so 
5 generated with a luciferase under conditions permitting the 
generation of light by the luciferase in the presence of 
ATP. A lucif erase-based reaction includes, for example, the 
reaction of luciferin and ATP in the presence of luciferase 
and 02/ whereby oxylucif erin, AMP, PPi, CO2, and light are 
10 produced. The light produced can be measured by any 
standard photometry technique including, but not limited 
to, photomultiplier, video, CCD, CCCD, and the naked eye. 

In a preferred embodiment, the moiety blocking the 3'-0 
15 atom of the dNTP is an allyl moiety and the single-stranded 
DNA 'is immobilized to a solid substrate. 

In another embodiment, determining whether pyrophosphate is 
generated in step (b) (i) is performed by detecting 
20 dissociation of a coumarin-derived indicator from a complex 
between the indicator and a bis-Zn 2+ -dipicolylamine 
coordination compound, wherein the coumarin-derived 
indicator has the following structure: 



25 and the bis-Zn 2+ -dipicolylamine coordination compound, when 
in association with the coumarin-derived indicator, has the 
following structure : 
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4NO s - 



In the preferred embodiment, the moiety blocking the 3' -O 
atom of the dNTP is an allyl moiety and the single-stranded 
DNA is immobilized to a solid substrate. In another 
5 embodiment the moiety blocking the 3'-0 atom of the dNTP is 
an allyl moiety or a methoxymethyl moiety. Preferably, the 
moiety is an allyl moiety. 

In the preferred embodiment of the instant method, the DNA 
10 is immobilized on a solid substrate. In different 
embodiments, the DNA is bound to the solid substrate via an 
azido linkage, an alkynyl linkage, a 1,3-dipolar 
cycloaddition linkage, or biotin-streptavidin interaction. 
The solid substrate can be, for example, in the form of a 
15 chip, a bead, a well, a capillary tube, or a slide. Also, 
for example, the solid substrate can be gold, quartz, 
silica, or plastic. In one embodiment of this invention, 
the solid substrate is porous . 

20 Single-stranded DNA can be immobilized on a solid surface, 
for example a glass surface, by a 1,3-dipolar cycloaddition 
reaction in the presence of a Cu(I) catalyst. The DNA is 
labeled with an azido group at the 5' end, while the glass 
surface is modified by an alkynyl group. After the 

25 1,3-dipolar cycloaddition between the azido and the alkynyl 
group in the presence of a Cu(I) catalyst at room 
temperature, the DNA is covalently attached to the surface 
via a stable 1 , 2 , 3-triazole linkage. The positions of the 
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azido and the alkynyl functional groups are 
interchangeable. The resulting 1, 2 , 3-triazoles are stable 
at aqueous conditions and high temperature. 

5 In the preferred embodiment of the instant methods, the 

moiety blocking the 3'-0 atom of the dNTP is an allyl 

moiety and removing it is performed using Na 2 PdCl 4 and 
TPPTS . 



10 This invention also provides a compound comprising a dNTP 
having bound to its 3' oxygen an allyl or methoxymethyl 
moiety. In the preferred embodiment, the moiety is an allyl 
moiety. In another embodiment, the moiety is a 
methoxymethyl moiety. In specific embodiments the dNTP is 

15 dATP, dCTP, dGTP, or dTTP. In a further embodiment, the 
instant compound is a 3'-0-allyl dNTP, and specifically 3'- 
O-allyl dATP, 3'-0-allyl dCTP, 3'-0-allyl dGTP, 3'-0-allyl 
dUTP or 3'-0-allyl dTTP . 

20 Examples of allyl derivatives include, without limitation, 
analogs or homologs thereof, or haloallyls such as 
iodoallyl, chloroallyl and fluoroallyl which perform as 
blocking moieties. Examples of methoxymethyl derivatives 
include, without limitation, analogs or homologs thereof 

25 which perform as blocking moieties. 



This invention also ' provides a process for producing a 3 f - 
O-allyl dNTP comprising: 

(a) sequentially contacting a dimethoxytrityl ( DMTr ) 3' 
30 protected nucleoside triphosphate with (i) 3-bromo 

propene, NaOH and benzene, and (ii) a suitable 
solvent; and 
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(b) sequentially contacting the product of step (a) with 
(i) POCl 3 /(MeO) 3 P(0) , (ii) tributylammonium 

pyrophosphate, and (iii) TEAB/NH 4 OH, so as to produce 
the 3'-0-allyl dNTP . 

5 

In one embodiment of the instant method, the suitable 
solvent of step (a) (ii) is 3% THF/CHC1 3 . In another 
embodiment, the concentration of TEAB in step (b) (iii) is 
about 0.1M. 

10 

Finally, this invention provides a kit for use in 
sequencing a single-stranded DNA comprising: 

(a) 3'-0-allyl dATP, 3'-0-allyl dCTP, 3'-0-allyl dTTP, and 
3'-0-allyl dGTP, each in a separate compartment ; and 
15 (b) instructions for use. 

In various embodiments, the instant kit further comprises 
(i) a 9°N DNA polymerase (exo-) A4851/Y409V, (ii) reagents 
permitting DNA polymerization, (iii) reagents permitting 
20 pyrophosphate detection using a lucif erase-based reaction, 
(iv) reagents permitting pyrophosphate detection using a 
coumarin-derived indicator, and/or (v) reagents permitting 
removal of an allyl group from a 3'-0-allyl dNTP . 

25 In differing embodiments, the 3 ' -O-allyl-modif ied dNTP has 
one of the following structures : 
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This invention will be better understood by reference to 
the Experimental Details which follow, but those skilled in 
the art will readily appreciate that the specific 
experiments detailed are only illustrative of the invention 
5 as described more fully in the claims which follow 
thereafter. 
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Experimental Details 

The general scheme of the improved pyrosequencing method is 
5 shown in Fig. 2. 3' -O-allyl-dNTPs (A, C, G, T) , instead of 
dNTPs, are used in the single base extension on a solid 
surface. Four allyl-dNTPs are added iteratively. Once the 
complementary base is incorporated, the pyrophosphate that 
is produced from the reaction can be detected by its 

10 reaction with the light-generating luciferase system or a 
receptor-indicator (R-I) coordination compound via the 
release of the fluorescent indicator molecule. In one case 
the R-I compound has negligible or no fluorescence, and the 
released/displaced indicator is detectably fluorescent. 

15 Then the extended primers can be deallylated, washed and 
reused in the next round. The use of an allyl group solves 
inherent problems of traditional pyrosequencing. 

Synthesis and Deallylation of 3' -O-Allyl-dNTP and its 
2 0 Performance in Single Base Extension 

A mild condition to remove a 3'-0-allyl group from DNA in 
aqueous solution using a catalyst system formed by Na 2 PdCl4 
and a water-soluble ligand 3,3',3''- 

25 phosphinidynetris (benzenesulf onic acid) , trisodium salt 
(TPPTS) [3] has been identified. Using this condition, the 
deallylation of the purified 19-mer oligonucleotide (5'- 
AGAGGATCCAACCGAGAC-T ( allyl ) -3' ) (SEQ ID NO : 8 ) was 

established using MALDI-TOF mass spectrometry. In Fig. 3A, 

30 the mass peak at m/z 5871 corresponds to the mass of the 
purified oligonucleotide bearing the allyl group. The 
deallylation reaction on this oligonucleotide was carried 
out using the Na 2 PdCl 4 /TPPTS system. Fig. 33 shows near 
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complete deallylation with a 
1/50/400 in a reaction time 
mass peak at m/z 5831. 
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DNA/catalyst/ligand ratio of 
of 30 sees, as shown by the 



5 The next step was to prove that the deallylated product 
could be used in a primer extension reaction and that 
deallylation did not hinder the continuation of the 
polymerase reaction. A single base extension reaction using 
the deallylated product as a primer was performed with a 

10 synthetic template and a Biotin-ddGTP nucleotide terminator 
complementary to the base immediately adjacent to the 
priming site on the template. The extension product was 
isolated using solid phase capture purification and 
analyzed using MALDI-TOF MS [4]. The mass spectrum in Fig. 

15 3C shows a clear peak at m/z 6535 corresponding to the 
extension product proving that the deallylated product can 
be successfully used as a primer in a polymerase reaction. 



These experiments established that Na 2 PdCl 4 and TPPTS could 
20 be used to efficiently carry out deallylation on DNA in an 
aqueous environment without the need for an allyl scavenger 
or harsh conditions. A next step was to ensure that an 
allyl-modif ied nucleotide could be incorporated in a DNA 
Polymerase reaction. For this purpose/ a nucleotide 
25 analogue 3 ' -allyloxythymidine triphosphate (3'-0-allyl- 
dTTP) was synthesized (Fig. 4) and its incorporation 
ability was tested using a mutant form of 9°N DNA 
Polymerase (exo-) bearing the mutations A485L and Y409V. 
Results- showed that this enzyme could incorporate 3' -0- 
30 allyl-dTTP in a polymerase reaction. 3' -O-allyl-dGTP, 3'- 
O-allyl-dATP and 3 ' -O-allyl-dCTP can be similarly prepared 
according to the scheme set forth in Fig. 4. 
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The 3' -O-allyl-thymidine triphosphate was used in a primer 
extension reaction to demonstrate its ability to be 
incorporated into a growing DNA strand by DNA Polymerase. 
The extension was performed using a 15-jj.I reaction mixture 
5 consisting of 50 pmol of an 18-mer primer (5' -AGA-GGA-TCC- 
AAC-CGA-GAC-3' ) (SEQ ID NO: 9), 100 pmol of single-stranded 
60-mer DNA template (5' -GTG-TAC-ATC-AAC-ATC-ACC-TAC-CAC- 
CAT-GTC-AGT-CTC-GGT-TGG-ATC-CTC-TAT-TGT-GTC-CGG-3 ' ) (SEQ ID 
NO: 10) corresponding to a portion of exon 7 of the p53 gene 

10 (200 pmol of 3' -O-allyl-thymidine triphosphate), IX 
Thermopol reaction buffer (New England Biolabs) and 15 U of 
9°N DNA polymerase (exo-) A485L/Y409V. The extension 
reaction consisted of 15 cycles at 94 °C for 20 sec, 48 °C 
for 30 sec and 60 °C for 60 sec. The product was desalted 

15 using Zip Tip and analyzed using MALDI-TOF MS. The mass 
spectral data are shown in Fig. 5. Fig. 5(A) shows a single 
mass peak at m/z 552 6 corresponding to the unextended 
primer. Fig. 5(B) shows a single peak at m/z 5869 
corresponding to the primer extended by a single base 3'-0- 

20 allyl-thymidine triphosphate. These data confirm that the 
above 3'-allyl- modified nucleotide analogue can be 
efficiently incorporated by 9°N DNA polymerase (exo-) 
A485L/Y409V. 

2 5 Single Base Extension on Solid Surface with 3' -Allyl-dNTP 
(Click Chemistry) 

In order to separate primers from the mixture after SBE and 
deallylation, the primers can be immobilized on a solid 
30 surface. One common method is to use paramagnetic beads 
which are coated with streptavidin . Primers which are 
labeled with biotin can be attached to the beads because of 
the biotin-streptavidin attraction. A recently developed 
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DNA immobilization method using click chemistry, [6] hereby 

incorporated by reference/ can be used in the 

pyrosequencing method disclosed here. With the addition of 
template, ally-dNTP and polymerase, the extension can take 
5 place on the beads . 

A, New PPi Detection System for use in Pyrosequencing 
( Chemos ensing Ensemble ) 

10 Roger reported a fluorescence chemosensing system 
(Chemosensing Ensemble) which is described and shown to 
selectively detect pyrophosphate under physiological 
conditions [5] . Here, pyrophosphate is capable of 
displacing a fluorescent coumarin-derived indicator (I) 

15 from a bis-Zn 2+ -dipicolylamine (Zn 2+ DPA) coordination 
compound (R) . See Fig. 6. With an increase of the Receptor 
(R) amount, the non-fluorescent R-I coordination compound 
is formed. When the proportion of R is 50%, the 
fluorescence reaches its lowest, indicating a 1:1 

2 0 stoichionetry. Once pyrophosphate is added to the solution, 
it can replace the Indicator (fluorescence molecule) from 
the R-I coordination compound. Therefore, the fluorescence 
molecule is released/displaced,. and the fluorescence 
intensity of the solution will increase (Fig. 7A and B) . 

25 

Schematically: 

PPi + R-l Complex > PPi-R + 1 

(no fluorescence) (strong fluorescence) 

30 Using 3 ' -aliyl-dNTPs as reversible terminators overcomes 
the inherent problem that the pyrosequencing method 
otherwise has in accurately detecting the bases in 
homopolymeric regions, because each base via this invention 
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is extended one by one with high fidelity. Meanwhile, the 
newly designed PPi detection system is simple to use and is 
not affected by dATP . The paradigm of pyrosequencing can be 
useful in the presence of automatic sequencing machines 
5 where each step is repeated in cycles. 

Improvement upon the Ronaghi Method 

Ronaghi proposed a real time pyrosequencing method in 
10 solution [1] . In his method, four enzymes are needed. Among 
them, sulfurylase is used to transfer PPi to ATP; then 
luciferase is used to generate light that indicates PPi has 
been generated. In the next step apyrase is used to degrade 
ATP and excess dNTP in the reaction; then the process goes 
15 to the next round. However, apyrase activity is decreased 
in later cycles, which is due to the accumulation of 
intermediate products (such as deoxynucleoside diphosphate, 
or dNDP) and eventually undegraded dNTP. Because of this 
limitation, this method can determine the sequence of only 
20 about 100 bases at most. See Fig. 8. 

However, replacing dNTPs in Fig. 8 with the 3 ' -Allyl-dNTPs 
disclosed here, and then following the scheme in Fig. 8, 
permits one to unambiguously sequence the DNA using 
25 repeated cycles without the same degradation problems. 

The method disclosed here using the R-I complex can greatly 
improve the real-time pyrosequencing in the Ronaghi method. 
(Fig. 9). The R-I complex is used to detect PPi. PPi is 
3 0 converted to PPi-R, while the released indicator I can be 
transferred to the R-I complex by adding R without removing 
the components from the solution. The excess dNTP in each 
cycle is degraded by apyrase. Because there will be no AT? 
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produced in the detection steps , apyrase now primarily 
degrades dNTP and is more efficient in its action. 
Accordingly, more bases can be determined. 



5 Another advantage of this method is that only two kinds of 
enzymes are used here rather than four, and the detection 
step will not adversely affect the other steps. However, 
this improved method cannot detect the bases in 
homopolymeric regions either, and so 3'-0 allyl dNTPs are 
10 employed to circumvent this problem. 



WO 2007/002204 



24 



PCT/US2006/024157 



References 

1. Ronaghi M . , Uhlen M, Nyren P. A sequencing method based 
5 on real-time pyrophosphate. Science 281(5375), 363-365 

(1998) . 

2 . Ronaghi M. , Karamohamed S., Pettersson 3., Uhlen M . , 
Nyren P. Real-time DNA sequencing using detection of 
pyrophosphate release. Anal. Biochem. 242(1), 84-89 

10 (1996). 

3. "Design and Synthesis of a 3' -O-Allyloxy Photocleavable 
Fluorescent Nucleotide as a Reversible Terminator for DNA 
Sequencing By Synthesis" . H. Ruparel, L. Bi, Z. Li, X. 
Bai, D. H. Kim, N. Turro & J. Ju. Proceedings of the 

15 National Academy of Sciences USA 2005, 102, 5932-5937. 

4. Edwards, J. R. , Itagaki, Y. & Ju, J. Solid Phase 
Capturable Dideoxynucleotides for Multiplex Genotyping 
Using Mass Spectrometry (2001) . Nucleic Acids Res. 29, 
el04 (pl-6) . 

20 5 . Roge G. etc. An indicator displacement system for 
fluorescent detection of phosphate oxyanions under 
physiological conditions. Tetrahedron Letters 45(2004) 
8721-872.4 . 

6. Ju, J. et al., U.S. Patent 6,664,079. 



25 



WO 2007/002204 

What is claimed is: 



25 



PCT/US2006/024157 



1. A method for determining the nucleotide sequence of a 
single-stranded DNA comprising performing the 
5 following steps for each nucleic acid residue of the 

DMA whose identity is to be determined: 

(a) contacting the DNA under DNA polymerization- 
permitting conditions with (i) a 3' -O-blocked 
dNTP selected from the group consisting of 3'-0- 

10 blocked dATP, 3' -O-blocked dCTP, 3'-0-blocked 

dGTP, and 3' -O-blocked dTTP, and (ii) 9°N DNA 
polymerase (exo-) A4851/Y409V; 

(b) (i) determining whether pyrophosphate is 
generated as a result of step (a) , whereby (1) 

15 pyrophosphate generation indicates that 

polymerization has occurred and the identity of 
the nucleic acid residue in the DNA is that which 
is complementary to the 3' -O-blocked dNTP used in 
part (i) of step (a), and (2) the absence of 

20 pyrophosphate generation indicates that the 

identity of such nucleic acid residue is not that 
which is complementary to such 3' -O-blocked dNTP, 
and (ii) if pyrophosphate is not generated, 
repeating step (a) once, twice or three times as 

25 necessary, wherein in each repetition a 3' -CD- 

blocked dNTP is used which is different from any 
3 ' -O-blocked dNTP already used, and determining, 
after each repetition of step (a) , whether 
pyrophosphate is generated, such generation 

3 0 indicating that polymerization has occurred and 

the identity of the nucleic acid residue in the 
DNA is that which is complementary to the 3'-0- 
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blocked dNTP used in part (i) of the repeated 
step (a) ; and 

(c) removing from the 3'-0-blocked dNTP polymerized 
in step (a) or (b), whichever is applicable, the 
5 moiety blocking the 3'-0 atom of the dNTP, with 

the proviso that such removing step is optional 
in the event that there remains no further 
nucleic acid residue of the DNA whose identity is 
to be determined. 

.0 

2. The method of claim 1, wherein determining whether 
pyrophosphate generated in step (b) (i) is performed by 
detecting light generated by a lucif erase-based 
reaction. 

.5 

3. The method of claim 2, wherein the lucif erase is firefly 
lucif erase . 

4. The method of claim 2, wherein the lucif erase-based 
0 reaction comprises contacting the pyrophosphate with a 

sulfurylase under conditions permitting the generation 
of ATP from the pyrophosphate, and contacting the ATP so 
generated with a luciferase under conditions permitting 
the generation of light by the luciferase in the 
5 presence of ATP. 

5. The method of claim 1, wherein determining whether 
pyrophosphate is generated in step (b) <i) is performed 
by detecting dissociation of a coumarin-derived 

0 indicator from a complex between the indicator and a 

bis-Zn 2+ -dipicolylamine coordination compound, wherein 
the coumarin-derived indicator has the following 
structure : 
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HO. 



HO' 
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•O 



6. 

10 
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3 . 

15 

9. 

20 

10. 



and the bis-Zn -dipicolylamine coordination compound, 
when in association with the coumar in-derived 
indicator, has the following structure: 



The method of claim 1, wherein the moiety blocking the 
3'-0 atom of the dNTP is an allyl moiety or a 
methoxymethyl moiety . 

The method of claim 6, wherein the moiety is an allyl 
moiety. 

The method of claim 1, wherein the DNA is immobilized on 
a solid substrate. 

The method of claim 8, wherein the DNA is immobilized on 
the solid substrate via an azido linkage, an alkynyl 
linkage, 1,3-dipolar cycloaddition linkage, or biotin- 
s treptavidin interaction . 

The method of claim 8, wherein the solid substrate is in 
the form of a chip, a bead, a well, a capillary tube, or 
a slide. 




4N0 3 * 
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11. The method of claim 8, wherein the solid substrate is 
gold, quartz, silica, or plastic. 

5 12. The method of claim 8, wherein the solid substrate is 
porous . 

13. The method of claim 4, wherein the moiety blocking the 
3'-0 atom of the dNTP is an allyl moiety and the single- 

10 stranded DNA is immobilized to a solid substrate. 

14. The method of claim 5, wherein the moiety blocking the 
3'-0 atom of the dNTP is an allyl moiety and the single- 
stranded DNA is immobilized to a solid substrate . 

15 

15. The method .of claim 1, wherein the moiety blocking the 
3'-0 atom of the dNTP is an allyl moiety and removing it 
is performed using Na 2 PdCl 4 and TPPTS . 

20 16. A compound comprising a dNTP having bound to its 3' 
oxygen an allyl or methoxymethyl moiety. 

17. The compound of claim 16, wherein the moiety is an allyl 
moiety . 

25 

18. The compound of claim 16, wherein the moiety is a 
methoxymethyl moiety. 

19. The compound of claim 16, wherein the dNTP is dATP. 

30 

20. The compound of claim 16, wherein the dNTP is dCTP. 



21. 



The compound of claim 16, wherein the dNTP is dGTP. 
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22. The compound of claim 16, wherein the dNTP is dTTP . 

23. The compound of claim 16, wherein the compound is a 3'- 
5 O-allyl dNTP. 

24. The compound of claim 23, wherein the compound is 3'-0- 
allyl dATP. 

10 25. The compound of claim 23, wherein the compound is 3'-0- 
allyl dCTP. 

26. The compound of claim 23, wherein the compound is 3' -O- 
allyl dGTP. 

15 

27. The compound of claim 23, wherein the compound is 3'-0- 
allyl dTTP. 

28. A compound comprising a dNTP having bound to its 3' 
2 0 oxygen ■ an allyl derivative or a methoxymethyl 

derivative . 



29. A process for producing a 3' -O-allyl dNTP comprising: 

(a) sequentially contacting a dimethoxytrityl (DMTr) 
25 3' protected nucleoside triphosphate with (i) 3- 

bromo propene, NaOH and benzene, and (ii) a 
suitable solvent; and 
(b) ' sequentially contacting the product of step (a) 
with (i) POCl 3 / (MeO) 3 P (O) , (ii) tributylammonium 
30 pyrophosphate, and (iii) TEAB/NH4OH, so as to 

produce the 3' -O-allyl dNTP. 
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30. The method of claim 29, wherein the suitable solvent of 
step (a) (ii) is 3% THF/CHC1 3 . 

31. The method of claim 29, wherein the concentration of 
5 TEAB in step (b) (iii) is about 0.1M. 

32 . A kit for use in sequencing a single-stranded DNA 
comprising: 

(a) 3'-0-allyl dATP, 3'-0-allyl dCTP, 3'-0-allyl dTTP, 
10 and 3'-0-allyl dGTP, each in a separate 

compartment; and 
(b) instructions for use. 

33. The kit of claim 32 , further comprising a 9°N DNA 
15 polymerase (exo-) A4851/Y409V. 

34. The kit of claim 33, further comprising reagents 
permitting polymerization. 

20 35. The kit of claim 33, further comprising reagents 
permitting pyrophosphate detection using a luciferase- 
based reaction. 



36. The kit of claim 33, further comprising reagents 
25 permitting pyrophosphate detection using a coumarin- 

derived indicator . 

37. The kit of claim 33, further comprising reagents 
permitting removal of an allyl group from a 3'-0-allyl 

3 0 dNTP. 
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Fig. 5 



WO 2007/002204 



6/19 



PCT/US2006/024157 




4N0 3 " 




Fig. 6 



WO 2007/002204 



7/19 



PCT/US2006/024157 




Fig. 7 



WO 2007/002204 



8/19 



PCT/US2006/024157 



iterative addition 



-*dATP+ dTTP-* dGTP + dCTP-* 1 
il +2 |3 i4 
DNA(n)+dNTP Polymeras PPi + DNA (n+1) 

PPi Sulfurylase ATP 

ATP Luciferase Light 
► 

ATP apyrase AMP + 2Pi 
► 

dNTP apyrase dNMP + 2Pi 



Fig. 8 



WO 2007/002204 



9/19 



PCT/US2006/024157 



iterative addition 

"*dATP -> dTTP -*dGTP -*dCTP -* 

DNA(n)+dNTP Polymeras PPi + DNA 
(n+1) 

R 

< 

* t 

PPi + R-l Complex — ► PPi-R + | 



dNTP apyrase dNMP + 2Pi 
► 



Fig. 9 



WO 2007/002204 



10/19 



PCT/US2006/024157 




MW=502.99 
3'-0-al!yl-dCTP 

Fig. 10 



MW=517.99 



3'-0-a!lyl-dTTP 



WO 2007/002204 PCT/US2006/024157 

11/19 



Primer 

Mass: 

6084 



1) 


6436 


Primer+ 




Allyl-dATP 







5199.4 



Primer 

Mass: 

6131 



2) 



6498 

Primer+ 

Allyl-dGTP 



6399.8 7600.2 



5199.4 6399.8 7600.2 



Primer 


3) 




Mass: 


7697 


7373 




Primer* 






Allyl-dCTP 









6399.8 7600.2 



88006 



Primer 

Mass: 

7996 



4) 




8339 






Primer+ 






Allyl-dTTP 









7300 



Fig. 1 1 



WO 2007/002204 



13/19 



PCT/US2006/024157 



f 

u 

\ 

cj 
< 

h- 
< 

b 

< 

< 
< 
< 

u 
u 
u 

I 

u 



■ 

Cn 



(J 



;* 

b 

u 
J? 



u 

I 



-c_> 

-< 

-I— 

-o 

-o 

h< 
-»— 

-o 
-o 

-< 

- 1 — 

~ O 
-o 

-o 
-o 
-< 

-co 
- IIJ 



E u 

> C Ul 

N <d <L) 

£ => "O 

<U CT 43 

C g 

_L C U 

O >• =3 

u n c 

ill 





cn 






c: 






• — 

u 








CO 




<D 










0 


cr 


-1— > 


4-J 


<u 


0 


03 


CO 


CD 


i_ 


0 




O 


v_ 


U 


Q. 




ZJ 


i_ 


Q. 


C 


nco 


by 


lar 


» 11 — 


T3 
CD 


regu 


lyl-d 








< £ 





m 
do 

* T-H 



WO 2007/002204 



14/19 



PCT/US2006/024157 



Allyl-dGTP 

ttccc — , r\ _ cce Polymerase 



O TT ^- =1 PPi detectio n Deallylation Q- 



-TTCCC 
HO /G 



Allyl-dGTP PPi detection Deallylation The third round 



Fig. 14 



WO 2007/002204 



15/19 



PCT/US2006/024157 



cd H 

'If 

O 5=5 

Q ■ rs 



5-1 

CS q 

.£ ^ 

CD CO 

O CD 

& >< 



-< 



CO 

Ph 
H 



CO 

CO 
<D 
5-t 

a 
o 

CD 

g- 

CO 

t 

p-. 



5 

CO 

-t— > 

g ^ 

bo ^ 

<o 'co 

✓ N "C* 

^3 CN 



-© 




WO 2007/002204 



16/19 



PCT/US2006/024157 



U 

c 

2 §■ 

u o> 
(0 w 

< 01 

• J 

re q. 
a "a 

=3§ 

(0 -I 



u 



I-*"*!-. 



V) U 

< I— 

H cd 

£ G 

§§ 

3 t 

< h- 

(J u 
CD U 

U 

U «*> 

U 

< 
tD 

u 



CD 




O 



C3 



a 
o 

• i-H 

CO 

8 

X 
a> 



WO 2007/002204 



17/19 



PCT/US2006/024157 



90 
80 

1 60 

CD 

£ 50 

: 40 

3 30 

BO 

£ 20 
10 
0 







' .'.-,,-.-s :;-. 


- • ■ ■ -'-v... - • v. ' ■■■ - 

v.-;-..- .«>;*-: .:• ':■ 
— — .. - /- .■••^ " — 














' :•! 

.: - 


















' ••••v *^-r , - , ... 








; .-' A • , .■ , . • . r-r* • 



























O.Snmol 
dATP 



O^nmol 
allyl-dATP 



l.Snmoi 
allyl-dATP 



l.Snmoi 
ddATP 



Fig. 17 



WO 2007/002204 



18/19 



PCT/US2006/024157 




WO 2007/002204 



19/19 



PCI7US2006/024157 




WO 2007/002204 



PCT/US2006/024157 



18 



74689-a-pct.ST25 
SEQUENCE LISTING 

<110> The Trustees of Columbia University in the city of New York 

wuj jian 
Kim, Dae H 

<120> PYROSEQUENCING METHODS AND RELATED COMPOSITIONS 
<130> 0575/74689-A-PCT 
<160> 10 

<170> Patentln version 3.3 

<210> 1 

<211> .18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 1 

agaggatcca accgagac 

<210> 2 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 2 

cagagccaac ctaggaga 

<210> 3 

<211> 58 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 3 

tttttatcga attcccggaa acagctatga ccatgcttgc atggtcatag ctgtttcc 58 

<210> 4 

<211> 5 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 4 
ttccc 
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<210> 5 74689-a.pct.ST25 

<211> 6 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 5 

ttcccg 6 

<210> 6 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 6 

gtatgcagtt agctgtctgg cc 22 

<210> 7 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically synthesized 

<400> 7 

catacgtcaa tcgacagacc ggtttaaagg gccctactgc 40 

<210> 8 
<211> 19 
<212> DNA 

<213> Artifical Sequence 
<400> 8 

agaggatcca accgagact 19 

<210> 9 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer directed to human p53 gene 

<400> 9 

agaggatcca accgagac 18 

<210> 10 

<211> 60 

<212> DNA 

<213> homo sapiens 

<400> 10 
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